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Abstract: Image compression is an extended research area 
from a long time. Various compression schemes were 
developed for the compression of image in gray scaling and 
color image compression. Basically the compression is 
focused for lossy or lossless color image compression based 
on the need of applications. Where lossy compression are 
higher in compression ratio but are observed to be lower in 
retrieval accuracy. Various compression techniques such as 
JPEG and JPEG-2K architectures are developed to realize 
such and method. But with the need in high accuracy 
retreivation these techniques [8] are not suitable. To achieve 
nearby lossless compression!!] various other coding methods 
were suggested like lifting scheme coding. This coding result 
in very high retrieval accuracy but gives low compression 
ratio. This limitation is a bottleneck in current image 
compression architectures. So, there is a need in the 
development of a compressing approach where both higher 
compression as well as higher retrieval accuracy is obtained. 

Keyword: Image Compression, Context Modeling, Arial 
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I. INTRODUCTION 

Digital imagery has had an enormous impact on 
industrial, scientific and computer applications. Image 
coding has been a subject of great commercial interest in 
today's world. Uncompressed digital images require 
considerable storage capacity and transmission bandwidth. 
Efficient image compression solutions are becoming more 
critical with the recent growth of data intensive, 
multimedia-based web applications. An image is a 
positive function on a plane. The value of this function at 
each point specifies the luminance or brightness of the 
picture at that point. Digital images are sample versions of 
such functions, where the value of the function is specified 
only at discrete locations on the image plane, known as 
pixels. During transmission of these pixels the pixel data 
must be compressed to match the bit rate of the network. 

In order to be useful, a compression algorithm 
has a corresponding decompression algorithm[3] that, 
given the compressed file, reproduces the original file. 
There have been many types of compression algorithms 
developed. These algorithms fall into two broad types, 
loss less algorithms and lossy algorithms. A lossless 
algorithm reproduces the original exactly. A lossy 

algorithm, as its name implies, loses some data. Data loss 
may be unacceptable in many applications. For example, 



text compression must be lossless because a very small 
difference can result in statements with totally different 
meanings. While processing with documented images 
such as digitized map images the compression required are 
to be totally lossless, as a minor variation in the image 
data may result in a wrong representation of the map 
information in the case of roads, vegetation, dwelling etc. 
Various research works were carried out on both lossy and 
lossless image compression[l] in past. The image 
compression committee has come out with the JPEG 
committee with a release of a new image-coding standard, 
JPEG 2000 that serves the enhancement to the existing 
JPEG system. The JPEG 2000 implements a new way of 
compressing images based on the wavelet transforms in 
contrast to the transformations used in JPEG standard. 
There is a majority of today's Internet bandwidth is 
estimated to be used for images and video transmission. 
Recent multimedia applications for handheld and portable 
devices place a limit on the available wireless bandwidth. 

II. STATISTICAL IMAGE CODING 

A binary image can be considered as a message, 
generated by an information source. The idea of statistical 
modeling is to describe the message symbols (pixels) 
according to the probability distribution of the source 
alphabet (binary alphabet, in our case). Shannon has 
shown that the information content of a single symbol 
(pixel) in the message (image) can be measured by its 
entropy: 

H pixel = — Log 2 P , 
Where P is the probability of the pixel. Entropy of the 
entire image can be calculated as the average entropy of 
all pixels: 

H image = - i=l Log2 Pf t 

Where Pi is the probability of i* pixel and n is the total 
number of pixels in the image. 

If the probability distribution of the source alphabet (black 
and white pixels) is a priori known, the entropy of the 
probability model can thus be expressed as: 

H = -P w Log 2 P w - P B Log 2 P B , 

Where P w and P B are the probabilities of the white and 
black pixels, respectively. The more sophisticated 
Bayesian sequential estimator calculates probability of the 
pixel on the basis of the observed pixel frequencies as 
follows: 
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Where n l w , n l B are the time -dependent counters, p l w , Pb 
are the probabilities for white and black colors 
respectively, and 5 is a constant. Counters n l w and iib start 
from zero and are updated after the pixel has been coded 
(decoded). As in [JBIG1], we use 5 = 0.45. The 
cumulative equation for entropy is used to estimate the 
average bit rate and calculate the ideal code length. 

III. CONTEXT BASED STATISTICAL MODEL 

The pixels in an image form geometrical structures 
with appropriate spatial dependencies. [2] The 
dependencies can be localized to a limited neighborhood, 
and described by a context-based statistical model. In this 
model, the pixel probability is conditioned on the context 
C, which is defined as distinct black-white configuration 
of neighboring pixels within the local template. For 
binary images, the pixel probability is calculated by 
counting the number of black (n c B ) and white (n c w ) pixels 
appeared in that context in the entire image: 



c 
Pw 1 



[p c B - 



-P c w , 



if x = white 



if x = black 



Here, p c B and p c w are the corresponding probabilities of 
the black and white pixels. The entropy H(C) of a context 
C is defined as the average entropy of all pixels within the 
context: 

H(C) = - pw log 2 Pw - Pb log 2 Pb 

A context with skew probability distribution has 
smaller entropy and therefore smaller information content. 
The entropy of an N-level context model is the weighted 
sum of the entropies of individual contexts: 

H N = - Y, N j=i P(Cj). (pw Q . log 2 Pw Q + Pb Q log2 Pb Q ) 
In principle, a skewed distribution can be obtained through 
conditioning of larger regions by using larger context 
templates. However, this implies a larger number of 
parameters of the statistical model and, in this way, 
increases the model cost, which could offset the entropy 
savings. Another consequence is the "context dilution" 
problem occurring when the count statistics are distributed 
over too many contexts, thus affecting the accuracy of the 
probability estimates. 



IV. ALGORITHM 

The context size is a trade-off between the 
prediction accuracy and learning cost (in dynamic 
modeling) or model overhead (in semi-adaptive 
modeling). A larger template size gives us a theoretically 
better pixel prediction. This results in a skewer probability 
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distribution and lower bit-rates. 

CONSTRUCTION PROCEDURE 

To construct a context tree, the image is 
processed and the statistics n l w and n l B are calculated for 
every context in the full tree, including the internal nodes. 
The tree is then pruned by comparing the children and 
parents nodes at each level. If compression gain is not 
achieved from using the children nodes instead of their 
parent node, the children are removed from the tree and 
their parent will become a leaf node. The compression 
gain is calculated as: 

Gain ( C, C Wy C B )= l(C)-l( C w ) - l(C B ) - Split Cost , 
Where C is the parent context and C w and C B are the two 
children nodes. The code length / denotes the total number 
of output bits from the pixels coded using the context. The 
cost of storing the tree is integrated into the Split Cost 
parameter. The code length can be calculated by summing 
up the entropy estimates of the pixels as they occur in the 
image: /( C) = X? log p f ( C) The probability of the pixel is 
calculated on the basis of the observed frequencies using a 
Bayesian sequential estimator: 

nw(C)+ S 

P t{c)- """" 



Pw 



[C)+2S 



if / th pixel is white 



,Pb(& = 1; 



if t th pixel is black 



Where n w , n B are -dependent frequencies, and 

pV, Pb are me probabilities for white and black colors 
respectively, and 5 = 0.45, as in [JBIG1]. 
The template form and the pixel order in this example are 
optimized for topographic images. 




Figure: 1. Illustration of a context tree. 

In the bottom-up approach, the tree is analyzed from the 
leaves to the root. A full tree of k Max levels is first constructed 
by calculating statistics for all contexts in the tree. The tree is then 
recursively pruned up to level k M i n , using the same criterion as 
in the top-down approach. The gain is calculated using the 
code length equation using /(C). The code lengths from the 
children contexts /(C w ) and /(C B ) are derived from the 
previous level of the recursion. The sub-trees of the nodes 
that do not deliver positive compression gain are removed 
from the tree. A sketch of the implementation is shown in 
Figure and the algorithm is illustrated in Figure. 
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Figure: 2. Illustration of bottom- up tree pruning. 

The bottom-up approach can be implemented 
using only one pass over the whole image. Unfortunately, 
high k MAX values will result in huge memory 
consumption. For this reason, a two-stage bottom-up 
pruning procedure was proposed in. In the first stage, the 
tree is constructed from the root to level K S tart and then 
recursively pruned until level k M in. In the second stage, the 
remaining leaf nodes at the level K START are expanded up 
to level kMax and then pruned until level K START ' In this 
way, the memory consumption depends mainly on the 
choice of the K START because only a small proportion of 
the nodes at that level remains after the first pruning stage. 
The starting level K START is chosen as large as the memory 
resources permit. This approach of modeling & pruning 
results in a higher compression of pixel representation in 
given image sample. 

V. SYSTEM DESIGN 

The suggested context modeling for color image 
compression is developed for the evaluation of Arial map 
images. These images are higher in textural variation with 
high color contrast. A compression scheme for such an 
image is designed and the block diagram for this method 
is as shown below, 



Tree — » Pinning 
Modeling 



Rcbieved 



Dntii 



This designed system read the color image 
information and transform to gray plane with proper data 
type conversion for the computation. During pre 
processing operation the map image is equalized with 
histogram equalization to normalize the intensity 
distribution to be in uniform level. This equalized image is 
then processed with binarization by using thresholding 
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technique to estimate the required region out of the whole. 
A mathematical morphology[10] operation is apply to 
extract the bounding regions and curves in the map image. 

The extracted regions are then passed to context 
tree modeling [7] for obtaining the context feature of the 
obtained regions. A tree model is developed as explained 
in previous sections for the obtained context features. The 
contexed features are quite large in count and are required 
to be reduced for higher compression. 
To achieve lower context counts a pruning operation is 
performed. A Pruning is a hierarchical coding technique 
for dimensionality reduction with a tracing of obtained 
context features in a hierarchical tree manner with 
branches and leaf projecting towards high dominative 
context features comparative to lower context features 
discarding at intermediate level. This pruning process 
hence ends out at minimum number of dominative context 
features resulting in low context information for context 
mapping. This results in faster computation of image data 
mapping with context feature for entropy encoding. 

Context mapping is a process of transforming the 
image coefficient to a context tree model mapping [9] with 
reference to obtained pruning output. The image pixels are 
mapped with pruning output and a binary stream 
indicating the mapping information is generated. This 
stream is passed to entropy encoder for performing binary 
compression using Huffman entropy coding. [4] [14] 

The dequantized information is passed to the 
demapping operation. The image coefficients are retrieved 
with a demapping of the context information to the 
dequantized data to obtain the original image information 
back. The demapping operation is carried out in a similar 
fashion as like the mapping operation with the reference of 
context table. For the regeneration of pixel coefficient the 
same context tree is used for the reverse tree generated to 
retrieve pixel coefficient. The obtained coefficients are 
post processed for the final generation of the image. 

This unit realigns the pixel coefficient to the grid 
level based on the generated context index during pre 
processing. The image retrieved after the computation is 
evaluated for PSNR value in the evaluator unit for the 
performance evaluation of suggested architecture with 
estimation accuracy as the quality factor. 
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VI. RESULTS 

For the processing of a developed system Arial captured 
map images with color information is passed. The images 
are passed in a higher resolution to attain best context 
extraction so as to maintain highest accuracy. The images 
are taken in a JPEG format and passed to the processing 
unit to compute histogram diagram for the normalization 
and further processing. 



Figure: 5. Histogram plot for the given query Image 




Figure: 8. Threshold Image Figure: 9. Extracted Image 



filled image Isolated Regions 




Figure: 10. Filled Image Figure: 11. Context Features 




Figure: 12. The Hierarchical Tree Modeling 
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PERFORMANCE OBSERVATIONS 

A performance evaluation is given for calculation 
of retrieval accuracy and compression ratio by evaluation 
of PSNR and percentage of compression ratio for the 
given query samples. 

Performance Observation for 8 & 24-Bit Depth 
of Sample- 1 




In 8- bit depth graph at low bit level compression is high 
for the context tree method because it is a hierarchical 
context tree where as existing JPEG2000 has very low 
compression but for larger data JPEG2000 almost tried to 
reach context method. In 24-bit depth also gives the same 
information. 




In 8- bit depth graph at low bit level PSNR value for the 
context tree method is in between 65 to 75 % where as 
existing JPEG2000 has very low PSNR value in between 
10 to 20 %, but for 24-bit depth JPEG2000 is reached to 
20 to 25 %. 




Figure: 13. Plots of Context Method and JPEG2000 

In 8- bit depth graph at low bit level computational time 
for the context tree method is high compare to JPEG2000 
has very low computational time, but for larger data 
JPEG2000 has taken very high computational time where 
as context tree has taken low computational time. 
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OBSERVATION TABLES 

Image Bit Depth: 8 - Bit Depth 



Sample 


6- Bit 
Depth 


Context Method 


JPEG-2000 


Dimension 


Compression 
Ratio 


PSNR 


Computational 
Time 


Compression 
Ratio 


PSNR 


Computational 
Time 


SI 


44*64 


56.33 


65.61 


0.70 


24.82 


12.39 


0.73 


110*110 


59.34 


71.35 


12.5 


40.04 


13.07 


1.78 


128*126 


59.34 


73.56 


12.5 


46.51 


13.32 


2.37 


256*254 


60.11 


71.60 


18.7 


54.06 


15.22 


12.8 


512*512 


60.74 


72.27 


51.5 


59.77 


18.51 


130.9 


S2 


64*64 


56.33 


65.61 


9.30 


24.82 


12.39 


10.73 


110*110 


59.34 


71.35 


9.30 


40.04 


13.07 


1.92 


128*126 


59.34 


73.56 


10.9 


46.51 


13.32 


2.40 


256*256 


60.11 


71.60 


18.7 


54.06 


15.22 


12.8 


512*512 


60.74 


72.27 


54.6 


59.77 


18.51 


129.7 


S3 


64*64 


56.33 


65.61 


9.30 


24.82 


12.39 


0.73 


110*110 


59.34 


71.35 


9.30 


40.04 


13.07 


1.89 


128*126 


59.34 


73.56 


12.5 


46.51 


13.32 


2.31 


256*256 


60.11 


71.60 


18.7 


54.06 


15.22 


13.01 


512*512 


60.74 


72.27 


57.6 


59.77 


18.51 


133.6 



Image Bit Depth: 24 - Bit Depth 



Sample 


24- Bit 
Depth 


Context Method 


JPEG-2000 


Dimension 


Compression 
Ratio 


PSNR 


Computational 
Time 


Compression 
Ratio 


PSNR 


Computational 
Time 


SI 


64*64 


58.33 


75.80 


12.5 


37.03 


15.90 


0.671 


110*110 


59.34 


76.19 


10.9 


46.93 


14.49 


1.73 


126*128 


59.34 


76.85 


10.9 


52.57 


16.69 


2.23 


256*256 


60.11 


74.92 


1.67 


56.07 


18.13 


12.4 


512*512 


60.74 


75.45 


54.2 


57.98 


21.75 


130.6 


52 


64*64 


58.33 


75.80 


9.30 


37.03 


15.97 


0.671 


110*110 


59.34 


76.19 


12.0 


46.93 


16.49 


1.72 


126*128 


59.34 


76.85 


12.5 


52.57 


16.69 


2.34 


256*256 


60.11 


74.92 


23.4 


56.07 


18.13 


12.46 


512*512 


60.74 


75.45 


54.2 


57.98 


21.75 


130.9 


S3 


64*64 


58.33 


75.80 


7.60 


37.03 


15.97 


0.75 


110*110 


59.34 


74.19 


12.5 


46.93 


16.49 


1.78 


126*128 


59.34 


74.85 


14.0 


52.57 


16.69 


2.29 


256*254 


60.11 


74.92 


21.6 


56.07 


18.13 


13.26 


512*512 


60.74 


75.45 


54.2 


57.98 


21.75 


131.2 



□ 



vn. CONCLUSION 

This Paper presents a compression approach in image 
processing applications using a hierarchical context 
modeling of highly varying color images with practical 
information of Arial mapping. The developed context tree 
modeling is focused with the objective of attaining 
minimum error and faster computations in processing 
these mapping images for real time applications. For the 
developed system the quality metric of situation accuracy 
with respect to PSNR value is computed and observed to 
be a higher value giving suggested method as feasible 
solutions for fast, high and lossless compression in 
practical environments. 
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